AITopics | information term

93661c10ed346f9692f4d512319799b3-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 23:29:05 GMT

On the training distribution animal and background feature are equally predictive of the label: It=1(y;a)=It=1(y;b)=It=1(y;ab).

artificial intelligence, dkl, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

661b1e76b95cc50a7a11a85619a67d95-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 02:48:24 GMT

algorithm, artificial intelligence, dreamer, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.77)

Add feedback

Information Subtraction: Learning Representations for Conditional Entropy

Leong, Keng Hou, Xiu, Yuxuan, Kin, Wai, Chan, null

arXiv.org Artificial IntelligenceJan-2-2025

We may consider the observations as samples from stochastic distributions and use informationtheoretic measures, as shown in Figure 1, to quantify the uncertainty and shared information among variables. These measures reveal the strength of relationships, including correlation and Granger causality between variables(Pearl 2009). Beyond merely recognizing the magnitude of such relationships, many representation learning works aim to further explain and describe them to enhance our understanding and control over the system (Yao et al. 2021; Xu et al. 2023). These approaches generate representations that maximize information about the targets, as they must be capable of accurately reconstructing the targets (Kingma and Welling 2013; Clark et al. 2019). Therefore, most methods are capable of effectively represent entropy H(Y) or mutual information I(X;Y), which describes the total information of Y and the shared information between X and Y, respectively, as shown in Figure 1. However, fewer methods have addressed the representation of other information terms such as conditional entropy H(Y |X) and conditional mutual information I(X;Y |W), which describes the information in Y not provided by X, and the information that X provides to Y but W does not, respectively. The representation of conditional mutual information is significant as it reveals the distinct impact of a specific factor on the target, which other factors do not provide. For example, identifying the distinct effect of funding on a scholar's publications, seperate from other factors, can guide policy decisions such as terminating funding that shows insignificant boosting. Furthermore, representing conditional entropy helps in creating fair and unbiased representations by removing the impact of sensitive factors.

artificial intelligence, information, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2501.02012

Country: Asia (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)

Add feedback

Disentanglement Analysis with Partial Information Decomposition

Tokui, Seiya, Sato, Issei

arXiv.org Machine LearningAug-31-2021

When we recognize objects, sounds, sentences, or whatever sensible, we quickly comprehend how it differs from others in properties that may individually vary across instances, such as color, shape, texture, pitch, rhythm, writing style, tone, etc. Such interpretable factors of variation are useful to understand what constitutes the variations of data and to manipulate data generation when a generative process is available. Disentanglement is a guiding principle of designing a learned representation separable into parts individually capture the underlying factors of variation. The concept is originally concerned as an inductive bias in representation learning towards obtaining representations aligned with the underlying factors of variation in data (Bengio et al., 2013) and has been applied to controlling otherwise unstructured representations of data from several domains, e.g., images (Karras et al., 2019; Esser et al., 2019), text (Hu et al., 2017), and audio (Hsu et al., 2019) to name just a few. While the concept is appealing, a concrete definition of disentanglement is not trivial. Most of the existing studies after Higgins et al. (2017) proposed generative learning methods that encourage latent variables to be marginally independent from each other; however, it is still not clear if that is the ultimate direction for better disentanglement (Higgins et al., 2018). To understand disentanglement, it is crucial to design disentanglement metrics that measure how representations disentangle the true generative factors, as it is not trivial as well to define such metrics (Higgins et al., 2017; Kim & Mnih, 2018; Chen et al., 2018; Eastwood & Williams, 2018).

information, latent variable, representation, (14 more...)

arXiv.org Machine Learning

2108.13753

Country:

Europe > Poland (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

A Note on Semantic Web Services Specification and Composition in Constructive Description Logics

Bozzato, Loris, Ferrari, Mauro

arXiv.org Artificial IntelligenceJul-14-2010

The idea of the Semantic Web is to annotate Web content and services with computer interpretable descriptions with the aim to automatize many tasks currently performed by human users. In the context of Web services, one of the most interesting tasks is their composition. In this paper we formalize this problem in the framework of a constructive description logic. In particular we propose a declarative service specification language and a calculus for service composition. We show by means of an example how this calculus can be used to define composed Web services and we discuss the problem of automatic service synthesis.

artificial intelligence, information term, semantic web, (14 more...)

arXiv.org Artificial Intelligence

1007.2364

Country: Europe > Italy (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Communications > Web > Semantic Web (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

Learning Graphical Models with Mercer Kernels

Bach, Francis R., Jordan, Michael I.

Neural Information Processing SystemsDec-31-2003

We present a class of algorithms for learning the structure of graphical models from data. The algorithms are based on a measure known as the kernel generalized variance (KGV), which essentially allows us to treat all variables on an equal footing as Gaussians in a feature space obtained from Mercer kernels. Thus we are able to learn hybrid graphs involving discrete and continuous variables of arbitrary type. We explore the computational properties of our approach, showing how to use the kernel trick to compute the relevant statistics in linear time. We illustrate our framework with experiments involving discrete and continuous data.

feature space, information, mutual information, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
Asia > Middle East > Jordan (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Learning Graphical Models with Mercer Kernels

Bach, Francis R., Jordan, Michael I.

Neural Information Processing SystemsDec-31-2003

We present a class of algorithms for learning the structure of graphical models from data. The algorithms are based on a measure known as the kernel generalized variance (KGV), which essentially allows us to treat all variables on an equal footing as Gaussians in a feature space obtained from Mercer kernels. Thus we are able to learn hybrid graphs involving discrete and continuous variables of arbitrary type. We explore the computational properties of our approach, showing how to use the kernel trick to compute the relevant statistics in linear time. We illustrate our framework with experiments involving discrete and continuous data.

feature space, information, mutual information, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
Asia > Middle East > Jordan (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Learning Graphical Models with Mercer Kernels

Bach, Francis R., Jordan, Michael I.

Neural Information Processing SystemsDec-31-2003

We present a class of algorithms for learning the structure of graphical models from data. The algorithms are based on a measure known as the kernel generalized variance (KGV), which essentially allows us to treat all variables on an equal footing as Gaussians in a feature space obtained from Mercer kernels. Thus we are able to learn hybrid graphs involving discrete and continuous variables of arbitrary type. We explore the computational properties of our approach, showing how to use the kernel trick to compute the relevant statistics in linear time. We illustrate our framework with experiments involving discrete and continuous data.

Add feedback

Filters

Collaborating Authors

information term

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

93661c10ed346f9692f4d512319799b3-Supplemental.pdf

661b1e76b95cc50a7a11a85619a67d95-AuthorFeedback.pdf

Information Subtraction: Learning Representations for Conditional Entropy

Disentanglement Analysis with Partial Information Decomposition

A Note on Semantic Web Services Specification and Composition in Constructive Description Logics

Learning Graphical Models with Mercer Kernels

Learning Graphical Models with Mercer Kernels

Learning Graphical Models with Mercer Kernels